A method for inferring compositional hierarchies from strings, useful for recognizing lexical structure in long sequences. Examples, source code, description, publications, and an on-line demonstration.
ANSI C implementation of E. Ukkonen's algorithm that makes it convenient to do string matching against a data set in O(N) time. Includes source code, an interface in Perl, and a write-up.